Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 1000000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 107.8 MiB |
| Average record size in memory | 113.0 B |
Variable types
| NUM | 14 |
|---|---|
| CAT | 1 |
Reproduction
| Analysis started | 2020-07-31 08:04:18.952002 |
|---|---|
| Analysis finished | 2020-07-31 08:10:38.903841 |
| Duration | 6 minutes and 19.95 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
symboling
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 976.9 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | |
| -1 | |
| 3 | |
| Other values (2) | 45444 |
| Value | Count | Frequency (%) | |
| 0 | 289560 | 29.0% | |
| 1 | 273678 | 27.4% | |
| 2 | 180133 | 18.0% | |
| -1 | 124271 | 12.4% | |
| 3 | 86914 | 8.7% | |
| -2 | 31241 | 3.1% | |
| -3 | 14203 | 1.4% |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.169715 |
| Min length | 1 |
normalized-losses
Real number (ℝ≥0)
| Distinct count | 995457 |
|---|---|
| Unique (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.70330671815898 |
|---|---|
| Minimum | 40.534787 |
| Maximum | 282.388034 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 40.534787 |
|---|---|
| 5-th percentile | 74.87666125 |
| Q1 | 95.43048225 |
| median | 113.370858 |
| Q3 | 142.4856448 |
| 95-th percentile | 190.62502 |
| Maximum | 282.388034 |
| Range | 241.853247 |
| Interquartile range (IQR) | 47.0551625 |
Descriptive statistics
| Standard deviation | 35.13634235 |
|---|---|
| Coefficient of variation (CV) | 0.2910967669 |
| Kurtosis | -0.002705286066 |
| Mean | 120.7033067 |
| Median Absolute Deviation (MAD) | 22.6180315 |
| Skewness | 0.7632331042 |
| Sum | 120703306.7 |
| Variance | 1234.562553 |
| Value | Count | Frequency (%) | |
| 102.181633 | 3 | < 0.1% | |
| 114.517639 | 3 | < 0.1% | |
| 77.918621 | 3 | < 0.1% | |
| 143.0425 | 3 | < 0.1% | |
| 77.851703 | 3 | < 0.1% | |
| 93.976843 | 3 | < 0.1% | |
| 97.173023 | 3 | < 0.1% | |
| 96.152531 | 3 | < 0.1% | |
| 101.244996 | 3 | < 0.1% | |
| 115.482465 | 3 | < 0.1% | |
| Other values (995447) | 999970 | > 99.9% |
| Value | Count | Frequency (%) | |
| 40.534787 | 1 | < 0.1% | |
| 41.68983 | 1 | < 0.1% | |
| 42.158735 | 1 | < 0.1% | |
| 43.4863 | 1 | < 0.1% | |
| 45.313506 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 282.388034 | 1 | < 0.1% | |
| 273.394494 | 1 | < 0.1% | |
| 271.267575 | 1 | < 0.1% | |
| 269.739521 | 1 | < 0.1% | |
| 269.445637 | 1 | < 0.1% |
wheel-base
Real number (ℝ≥0)
| Distinct count | 958623 |
|---|---|
| Unique (%) | 95.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.06851651822105 |
|---|---|
| Minimum | 82.937917 |
| Maximum | 120.491583 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 82.937917 |
|---|---|
| 5-th percentile | 91.4276658 |
| Q1 | 94.8503675 |
| median | 96.756566 |
| Q3 | 100.353569 |
| 95-th percentile | 108.7662701 |
| Maximum | 120.491583 |
| Range | 37.553666 |
| Interquartile range (IQR) | 5.5032015 |
Descriptive statistics
| Standard deviation | 5.07963421 |
|---|---|
| Coefficient of variation (CV) | 0.05179678852 |
| Kurtosis | 0.4516140198 |
| Mean | 98.06851652 |
| Median Absolute Deviation (MAD) | 2.4060295 |
| Skewness | 0.9574328535 |
| Sum | 98068516.52 |
| Variance | 25.80268371 |
| Value | Count | Frequency (%) | |
| 92.862593 | 5 | < 0.1% | |
| 95.227388 | 5 | < 0.1% | |
| 94.939058 | 4 | < 0.1% | |
| 96.853068 | 4 | < 0.1% | |
| 96.680891 | 4 | < 0.1% | |
| 96.730316 | 4 | < 0.1% | |
| 95.406188 | 4 | < 0.1% | |
| 96.603548 | 4 | < 0.1% | |
| 97.024539 | 4 | < 0.1% | |
| 96.816832 | 4 | < 0.1% | |
| Other values (958613) | 999958 | > 99.9% |
| Value | Count | Frequency (%) | |
| 82.937917 | 1 | < 0.1% | |
| 83.602043 | 1 | < 0.1% | |
| 84.037243 | 1 | < 0.1% | |
| 84.289896 | 1 | < 0.1% | |
| 84.454539 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 120.491583 | 1 | < 0.1% | |
| 120.379685 | 1 | < 0.1% | |
| 120.018509 | 1 | < 0.1% | |
| 119.694157 | 1 | < 0.1% | |
| 119.098644 | 1 | < 0.1% |
length
Real number (ℝ≥0)
| Distinct count | 984262 |
|---|---|
| Unique (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172.2793806400221 |
|---|---|
| Minimum | 135.551814 |
| Maximum | 207.326158 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 135.551814 |
|---|---|
| 5-th percentile | 152.8070862 |
| Q1 | 166.1661565 |
| median | 172.262017 |
| Q3 | 178.966987 |
| 95-th percentile | 191.2034415 |
| Maximum | 207.326158 |
| Range | 71.774344 |
| Interquartile range (IQR) | 12.8008305 |
Descriptive statistics
| Standard deviation | 11.11342729 |
|---|---|
| Coefficient of variation (CV) | 0.06450816834 |
| Kurtosis | -0.344536897 |
| Mean | 172.2793806 |
| Median Absolute Deviation (MAD) | 6.3250765 |
| Skewness | -0.0366286563 |
| Sum | 172279380.6 |
| Variance | 123.5082661 |
| Value | Count | Frequency (%) | |
| 171.915057 | 4 | < 0.1% | |
| 172.891985 | 4 | < 0.1% | |
| 171.880064 | 4 | < 0.1% | |
| 153.043085 | 4 | < 0.1% | |
| 166.938104 | 3 | < 0.1% | |
| 172.99327 | 3 | < 0.1% | |
| 170.454238 | 3 | < 0.1% | |
| 172.776458 | 3 | < 0.1% | |
| 173.243836 | 3 | < 0.1% | |
| 173.757012 | 3 | < 0.1% | |
| Other values (984252) | 999966 | > 99.9% |
| Value | Count | Frequency (%) | |
| 135.551814 | 1 | < 0.1% | |
| 135.858935 | 1 | < 0.1% | |
| 136.157776 | 1 | < 0.1% | |
| 136.314246 | 1 | < 0.1% | |
| 136.331646 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 207.326158 | 1 | < 0.1% | |
| 206.549537 | 1 | < 0.1% | |
| 205.929732 | 1 | < 0.1% | |
| 204.795305 | 1 | < 0.1% | |
| 203.402317 | 1 | < 0.1% |
width
Real number (ℝ≥0)
| Distinct count | 890886 |
|---|---|
| Unique (%) | 89.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.55292421936302 |
|---|---|
| Minimum | 61.012905 |
| Maximum | 74.819282 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 61.012905 |
|---|---|
| 5-th percentile | 63.1373288 |
| Q1 | 64.16206375 |
| median | 65.350526 |
| Q3 | 66.387642 |
| 95-th percentile | 69.51359005 |
| Maximum | 74.819282 |
| Range | 13.806377 |
| Interquartile range (IQR) | 2.22557825 |
Descriptive statistics
| Standard deviation | 1.888488794 |
|---|---|
| Coefficient of variation (CV) | 0.02880861253 |
| Kurtosis | 0.57168452 |
| Mean | 65.55292422 |
| Median Absolute Deviation (MAD) | 1.132618 |
| Skewness | 0.9454372789 |
| Sum | 65552924.22 |
| Variance | 3.566389926 |
| Value | Count | Frequency (%) | |
| 65.386533 | 8 | < 0.1% | |
| 65.383334 | 7 | < 0.1% | |
| 65.33898 | 7 | < 0.1% | |
| 65.447525 | 6 | < 0.1% | |
| 65.422335 | 6 | < 0.1% | |
| 65.360246 | 6 | < 0.1% | |
| 65.346851 | 6 | < 0.1% | |
| 65.373219 | 6 | < 0.1% | |
| 65.333824 | 6 | < 0.1% | |
| 65.411186 | 6 | < 0.1% | |
| Other values (890876) | 999936 | > 99.9% |
| Value | Count | Frequency (%) | |
| 61.012905 | 1 | < 0.1% | |
| 61.024455 | 1 | < 0.1% | |
| 61.027899 | 1 | < 0.1% | |
| 61.075525 | 1 | < 0.1% | |
| 61.077771 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 74.819282 | 1 | < 0.1% | |
| 74.713426 | 1 | < 0.1% | |
| 74.19741 | 1 | < 0.1% | |
| 74.129775 | 1 | < 0.1% | |
| 74.102701 | 1 | < 0.1% |
height
Real number (ℝ≥0)
| Distinct count | 935857 |
|---|---|
| Unique (%) | 93.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.873614720735 |
|---|---|
| Minimum | 47.830953 |
| Maximum | 62.224857 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 47.830953 |
|---|---|
| 5-th percentile | 50.2856807 |
| Q1 | 52.3209935 |
| median | 53.946569 |
| Q3 | 55.3061815 |
| 95-th percentile | 57.8678307 |
| Maximum | 62.224857 |
| Range | 14.393904 |
| Interquartile range (IQR) | 2.985188 |
Descriptive statistics
| Standard deviation | 2.255409585 |
|---|---|
| Coefficient of variation (CV) | 0.04186482746 |
| Kurtosis | -0.4408612577 |
| Mean | 53.87361472 |
| Median Absolute Deviation (MAD) | 1.5011835 |
| Skewness | 0.1882725852 |
| Sum | 53873614.72 |
| Variance | 5.086872396 |
| Value | Count | Frequency (%) | |
| 52.274748 | 5 | < 0.1% | |
| 55.143232 | 5 | < 0.1% | |
| 53.714244 | 4 | < 0.1% | |
| 54.705473 | 4 | < 0.1% | |
| 54.84929 | 4 | < 0.1% | |
| 55.066925 | 4 | < 0.1% | |
| 52.304852 | 4 | < 0.1% | |
| 52.605388 | 4 | < 0.1% | |
| 55.490885 | 4 | < 0.1% | |
| 52.290193 | 4 | < 0.1% | |
| Other values (935847) | 999958 | > 99.9% |
| Value | Count | Frequency (%) | |
| 47.830953 | 1 | < 0.1% | |
| 47.890499 | 1 | < 0.1% | |
| 47.943158 | 1 | < 0.1% | |
| 48.024727 | 1 | < 0.1% | |
| 48.112476 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 62.224857 | 1 | < 0.1% | |
| 62.157243 | 1 | < 0.1% | |
| 62.150985 | 1 | < 0.1% | |
| 62.149963 | 1 | < 0.1% | |
| 62.074271 | 1 | < 0.1% |
curb-weight
Real number (ℝ≥0)
| Distinct count | 999672 |
|---|---|
| Unique (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2450.2383411871415 |
|---|---|
| Minimum | 1488.079382 |
| Maximum | 4365.473381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1488.079382 |
|---|---|
| 5-th percentile | 1845.645156 |
| Q1 | 2094.865045 |
| median | 2345.431832 |
| Q3 | 2749.398274 |
| 95-th percentile | 3385.17949 |
| Maximum | 4365.473381 |
| Range | 2877.393999 |
| Interquartile range (IQR) | 654.5332285 |
Descriptive statistics
| Standard deviation | 471.8533317 |
|---|---|
| Coefficient of variation (CV) | 0.1925744625 |
| Kurtosis | -0.1542935494 |
| Mean | 2450.238341 |
| Median Absolute Deviation (MAD) | 308.7316065 |
| Skewness | 0.7621834777 |
| Sum | 2450238341 |
| Variance | 222645.5666 |
| Value | Count | Frequency (%) | |
| 2316.554523 | 2 | < 0.1% | |
| 2309.546845 | 2 | < 0.1% | |
| 2035.897823 | 2 | < 0.1% | |
| 1878.244326 | 2 | < 0.1% | |
| 2750.351516 | 2 | < 0.1% | |
| 2836.872694 | 2 | < 0.1% | |
| 2020.742304 | 2 | < 0.1% | |
| 2125.350915 | 2 | < 0.1% | |
| 2183.055035 | 2 | < 0.1% | |
| 2219.263314 | 2 | < 0.1% | |
| Other values (999662) | 999980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1488.079382 | 1 | < 0.1% | |
| 1503.701279 | 1 | < 0.1% | |
| 1508.525315 | 1 | < 0.1% | |
| 1511.797107 | 1 | < 0.1% | |
| 1513.030986 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4365.473381 | 1 | < 0.1% | |
| 4300.658928 | 1 | < 0.1% | |
| 4275.329993 | 1 | < 0.1% | |
| 4268.89444 | 1 | < 0.1% | |
| 4266.851729 | 1 | < 0.1% |
engine-size
Real number (ℝ≥0)
| Distinct count | 986280 |
|---|---|
| Unique (%) | 98.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 117.98050930962303 |
|---|---|
| Minimum | 47.038906 |
| Maximum | 270.101636 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 47.038906 |
|---|---|
| 5-th percentile | 85.8324626 |
| Q1 | 97.334543 |
| median | 109.1238825 |
| Q3 | 127.555666 |
| 95-th percentile | 183.2597272 |
| Maximum | 270.101636 |
| Range | 223.06273 |
| Interquartile range (IQR) | 30.221123 |
Descriptive statistics
| Standard deviation | 29.56153817 |
|---|---|
| Coefficient of variation (CV) | 0.2505628967 |
| Kurtosis | 1.339278696 |
| Mean | 117.9805093 |
| Median Absolute Deviation (MAD) | 13.5551745 |
| Skewness | 1.342225846 |
| Sum | 117980509.3 |
| Variance | 873.884539 |
| Value | Count | Frequency (%) | |
| 108.370474 | 4 | < 0.1% | |
| 108.349055 | 4 | < 0.1% | |
| 109.043763 | 4 | < 0.1% | |
| 109.193469 | 4 | < 0.1% | |
| 99.50827 | 4 | < 0.1% | |
| 109.477626 | 4 | < 0.1% | |
| 97.339492 | 3 | < 0.1% | |
| 107.813734 | 3 | < 0.1% | |
| 109.009979 | 3 | < 0.1% | |
| 108.445035 | 3 | < 0.1% | |
| Other values (986270) | 999964 | > 99.9% |
| Value | Count | Frequency (%) | |
| 47.038906 | 1 | < 0.1% | |
| 52.63709 | 1 | < 0.1% | |
| 53.738905 | 1 | < 0.1% | |
| 55.538281 | 1 | < 0.1% | |
| 56.875476 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 270.101636 | 1 | < 0.1% | |
| 267.26822 | 1 | < 0.1% | |
| 267.000231 | 1 | < 0.1% | |
| 266.23328 | 1 | < 0.1% | |
| 266.201796 | 1 | < 0.1% |
bore
Real number (ℝ≥0)
| Distinct count | 612985 |
|---|---|
| Unique (%) | 61.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3017691945530006 |
|---|---|
| Minimum | 2.596391 |
| Maximum | 4.051284 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 2.596391 |
|---|---|
| 5-th percentile | 2.908112 |
| Q1 | 3.10495 |
| median | 3.250946 |
| Q3 | 3.52834875 |
| 95-th percentile | 3.73413105 |
| Maximum | 4.051284 |
| Range | 1.454893 |
| Interquartile range (IQR) | 0.42339875 |
Descriptive statistics
| Standard deviation | 0.2628983532 |
|---|---|
| Coefficient of variation (CV) | 0.07962347992 |
| Kurtosis | -0.9976579628 |
| Mean | 3.301769195 |
| Median Absolute Deviation (MAD) | 0.201378 |
| Skewness | 0.2064503862 |
| Sum | 3301769.195 |
| Variance | 0.06911554409 |
| Value | Count | Frequency (%) | |
| 3.151921 | 10 | < 0.1% | |
| 3.14885 | 10 | < 0.1% | |
| 3.208548 | 9 | < 0.1% | |
| 3.254682 | 9 | < 0.1% | |
| 3.148043 | 9 | < 0.1% | |
| 3.146852 | 9 | < 0.1% | |
| 3.12833 | 9 | < 0.1% | |
| 3.184692 | 9 | < 0.1% | |
| 3.235669 | 9 | < 0.1% | |
| 3.213708 | 9 | < 0.1% | |
| Other values (612975) | 999908 | > 99.9% |
| Value | Count | Frequency (%) | |
| 2.596391 | 1 | < 0.1% | |
| 2.598211 | 1 | < 0.1% | |
| 2.604898 | 1 | < 0.1% | |
| 2.610413 | 1 | < 0.1% | |
| 2.610481 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4.051284 | 1 | < 0.1% | |
| 4.051191 | 1 | < 0.1% | |
| 4.033533 | 1 | < 0.1% | |
| 4.033516 | 1 | < 0.1% | |
| 4.033015 | 1 | < 0.1% |
stroke
Real number (ℝ≥0)
| Distinct count | 598812 |
|---|---|
| Unique (%) | 59.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.239506486957 |
|---|---|
| Minimum | 1.710764 |
| Maximum | 4.184797 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1.710764 |
|---|---|
| 5-th percentile | 2.6147529 |
| Q1 | 3.11806875 |
| median | 3.278242 |
| Q3 | 3.421419 |
| 95-th percentile | 3.67668615 |
| Maximum | 4.184797 |
| Range | 2.474033 |
| Interquartile range (IQR) | 0.30335025 |
Descriptive statistics
| Standard deviation | 0.30054921 |
|---|---|
| Coefficient of variation (CV) | 0.09277623342 |
| Kurtosis | 1.144027532 |
| Mean | 3.239506487 |
| Median Absolute Deviation (MAD) | 0.149862 |
| Skewness | -0.8735938003 |
| Sum | 3239506.487 |
| Variance | 0.09032982763 |
| Value | Count | Frequency (%) | |
| 3.418763 | 13 | < 0.1% | |
| 3.404794 | 13 | < 0.1% | |
| 3.418811 | 12 | < 0.1% | |
| 3.401461 | 12 | < 0.1% | |
| 3.404108 | 12 | < 0.1% | |
| 3.404311 | 11 | < 0.1% | |
| 3.427737 | 11 | < 0.1% | |
| 3.413743 | 11 | < 0.1% | |
| 3.430678 | 11 | < 0.1% | |
| 3.409931 | 11 | < 0.1% | |
| Other values (598802) | 999883 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1.710764 | 1 | < 0.1% | |
| 1.715495 | 1 | < 0.1% | |
| 1.71737 | 1 | < 0.1% | |
| 1.729503 | 1 | < 0.1% | |
| 1.763756 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4.184797 | 1 | < 0.1% | |
| 4.171517 | 1 | < 0.1% | |
| 4.168628 | 1 | < 0.1% | |
| 4.166581 | 1 | < 0.1% | |
| 4.164488 | 1 | < 0.1% |
compression-ratio
Real number (ℝ)
| Distinct count | 649974 |
|---|---|
| Unique (%) | 65.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.116499157775996 |
|---|---|
| Minimum | -12.104269 |
| Maximum | 43.255678 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -12.104269 |
|---|---|
| 5-th percentile | 7.5036708 |
| Q1 | 8.6460055 |
| median | 9 |
| Q3 | 9.35974625 |
| 95-th percentile | 19.70514305 |
| Maximum | 43.255678 |
| Range | 55.359947 |
| Interquartile range (IQR) | 0.71374075 |
Descriptive statistics
| Standard deviation | 3.908532595 |
|---|---|
| Coefficient of variation (CV) | 0.3863522879 |
| Kurtosis | 7.636462709 |
| Mean | 10.11649916 |
| Median Absolute Deviation (MAD) | 0.3571715 |
| Skewness | 2.591073397 |
| Sum | 10116499.16 |
| Variance | 15.27662705 |
| Value | Count | Frequency (%) | |
| 9 | 223288 | 22.3% | |
| 9.367976 | 8 | < 0.1% | |
| 8.679094 | 8 | < 0.1% | |
| 9.344802 | 8 | < 0.1% | |
| 8.707704 | 7 | < 0.1% | |
| 9.351552 | 7 | < 0.1% | |
| 9.293021 | 7 | < 0.1% | |
| 9.348654 | 7 | < 0.1% | |
| 8.704204 | 7 | < 0.1% | |
| 9.313152 | 7 | < 0.1% | |
| Other values (649964) | 776646 | 77.7% |
| Value | Count | Frequency (%) | |
| -12.104269 | 1 | < 0.1% | |
| -10.428191 | 1 | < 0.1% | |
| -10.295051 | 1 | < 0.1% | |
| -10.159511 | 1 | < 0.1% | |
| -10.018769 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 43.255678 | 1 | < 0.1% | |
| 42.114537 | 1 | < 0.1% | |
| 41.55852 | 1 | < 0.1% | |
| 41.124101 | 1 | < 0.1% | |
| 40.632899 | 1 | < 0.1% |
horsepower
Real number (ℝ≥0)
| Distinct count | 993654 |
|---|---|
| Unique (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.22985335323202 |
|---|---|
| Minimum | 36.841507 |
| Maximum | 232.513224 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 36.841507 |
|---|---|
| 5-th percentile | 59.8742576 |
| Q1 | 71.392133 |
| median | 89.191517 |
| Q3 | 111.3747885 |
| 95-th percentile | 157.9234415 |
| Maximum | 232.513224 |
| Range | 195.671717 |
| Interquartile range (IQR) | 39.9826555 |
Descriptive statistics
| Standard deviation | 29.84007262 |
|---|---|
| Coefficient of variation (CV) | 0.3133478795 |
| Kurtosis | 0.3936710173 |
| Mean | 95.22985335 |
| Median Absolute Deviation (MAD) | 19.5609885 |
| Skewness | 0.9615700689 |
| Sum | 95229853.35 |
| Variance | 890.4299338 |
| Value | Count | Frequency (%) | |
| 74.83921 | 3 | < 0.1% | |
| 70.714781 | 3 | < 0.1% | |
| 96.086838 | 3 | < 0.1% | |
| 76.434991 | 3 | < 0.1% | |
| 74.677608 | 3 | < 0.1% | |
| 90.695519 | 3 | < 0.1% | |
| 72.107417 | 3 | < 0.1% | |
| 88.383343 | 3 | < 0.1% | |
| 67.683882 | 3 | < 0.1% | |
| 115.796159 | 3 | < 0.1% | |
| Other values (993644) | 999970 | > 99.9% |
| Value | Count | Frequency (%) | |
| 36.841507 | 1 | < 0.1% | |
| 37.989707 | 1 | < 0.1% | |
| 39.100047 | 1 | < 0.1% | |
| 39.426082 | 1 | < 0.1% | |
| 39.759405 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 232.513224 | 1 | < 0.1% | |
| 231.766237 | 1 | < 0.1% | |
| 226.677513 | 1 | < 0.1% | |
| 224.542844 | 1 | < 0.1% | |
| 223.739673 | 1 | < 0.1% |
peak-rpm
Real number (ℝ≥0)
| Distinct count | 795296 |
|---|---|
| Unique (%) | 79.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5059.449083098581 |
|---|---|
| Minimum | 3576.659759 |
| Maximum | 6975.934291 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 3576.659759 |
|---|---|
| 5-th percentile | 4246.949641 |
| Q1 | 4800 |
| median | 5107.621803 |
| Q3 | 5355.311259 |
| 95-th percentile | 5847.253443 |
| Maximum | 6975.934291 |
| Range | 3399.274532 |
| Interquartile range (IQR) | 555.3112593 |
Descriptive statistics
| Standard deviation | 468.8905008 |
|---|---|
| Coefficient of variation (CV) | 0.09267619718 |
| Kurtosis | -0.3372159325 |
| Mean | 5059.449083 |
| Median Absolute Deviation (MAD) | 307.621803 |
| Skewness | -0.04756327049 |
| Sum | 5059449083 |
| Variance | 219858.3018 |
| Value | Count | Frequency (%) | |
| 4800 | 204432 | 20.4% | |
| 5185.723765 | 2 | < 0.1% | |
| 4431.635038 | 2 | < 0.1% | |
| 5333.175511 | 2 | < 0.1% | |
| 5853.162398 | 2 | < 0.1% | |
| 5106.028835 | 2 | < 0.1% | |
| 5091.000817 | 2 | < 0.1% | |
| 5311.804377 | 2 | < 0.1% | |
| 5278.714461 | 2 | < 0.1% | |
| 5052.780838 | 2 | < 0.1% | |
| Other values (795286) | 795550 | 79.6% |
| Value | Count | Frequency (%) | |
| 3576.659759 | 1 | < 0.1% | |
| 3591.985613 | 1 | < 0.1% | |
| 3627.490818 | 1 | < 0.1% | |
| 3627.875879 | 1 | < 0.1% | |
| 3628.07352 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6975.934291 | 1 | < 0.1% | |
| 6950.962767 | 1 | < 0.1% | |
| 6779.007737 | 1 | < 0.1% | |
| 6774.902314 | 1 | < 0.1% | |
| 6772.684643 | 1 | < 0.1% |
city-mpg
Real number (ℝ≥0)
| Distinct count | 971550 |
|---|---|
| Unique (%) | 97.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.530282290655002 |
|---|---|
| Minimum | 9.229116 |
| Maximum | 58.519152 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 9.229116 |
|---|---|
| 5-th percentile | 17.74742855 |
| Q1 | 22.9404365 |
| median | 26.1711805 |
| Q3 | 29.58167275 |
| 95-th percentile | 37.7468223 |
| Maximum | 58.519152 |
| Range | 49.290036 |
| Interquartile range (IQR) | 6.64123625 |
Descriptive statistics
| Standard deviation | 5.877974145 |
|---|---|
| Coefficient of variation (CV) | 0.2215571655 |
| Kurtosis | 0.09416799495 |
| Mean | 26.53028229 |
| Median Absolute Deviation (MAD) | 3.311382 |
| Skewness | 0.5314613908 |
| Sum | 26530282.29 |
| Variance | 34.55058005 |
| Value | Count | Frequency (%) | |
| 25.923419 | 4 | < 0.1% | |
| 26.564778 | 4 | < 0.1% | |
| 26.369762 | 4 | < 0.1% | |
| 26.555285 | 4 | < 0.1% | |
| 29.510816 | 4 | < 0.1% | |
| 27.397433 | 4 | < 0.1% | |
| 26.352379 | 4 | < 0.1% | |
| 23.093315 | 4 | < 0.1% | |
| 23.903483 | 4 | < 0.1% | |
| 25.914599 | 4 | < 0.1% | |
| Other values (971540) | 999960 | > 99.9% |
| Value | Count | Frequency (%) | |
| 9.229116 | 1 | < 0.1% | |
| 12.38302 | 1 | < 0.1% | |
| 12.604281 | 1 | < 0.1% | |
| 12.625376 | 1 | < 0.1% | |
| 12.626523 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 58.519152 | 1 | < 0.1% | |
| 53.644571 | 1 | < 0.1% | |
| 53.236547 | 1 | < 0.1% | |
| 52.306984 | 1 | < 0.1% | |
| 51.819629 | 1 | < 0.1% |
highway-mpg
Real number (ℝ≥0)
| Distinct count | 972668 |
|---|---|
| Unique (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.01049234188199 |
|---|---|
| Minimum | 14.015803 |
| Maximum | 61.303145 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 14.015803 |
|---|---|
| 5-th percentile | 22.3859976 |
| Q1 | 28.28621525 |
| median | 31.910125 |
| Q3 | 35.48576275 |
| 95-th percentile | 43.8822787 |
| Maximum | 61.303145 |
| Range | 47.287342 |
| Interquartile range (IQR) | 7.1995475 |
Descriptive statistics
| Standard deviation | 6.156265082 |
|---|---|
| Coefficient of variation (CV) | 0.1923202248 |
| Kurtosis | 0.3881314078 |
| Mean | 32.01049234 |
| Median Absolute Deviation (MAD) | 3.6015115 |
| Skewness | 0.4990222171 |
| Sum | 32010492.34 |
| Variance | 37.89959976 |
| Value | Count | Frequency (%) | |
| 31.917286 | 5 | < 0.1% | |
| 28.61701 | 5 | < 0.1% | |
| 29.139959 | 4 | < 0.1% | |
| 32.198521 | 4 | < 0.1% | |
| 32.600114 | 4 | < 0.1% | |
| 32.593874 | 4 | < 0.1% | |
| 29.483336 | 4 | < 0.1% | |
| 31.930899 | 4 | < 0.1% | |
| 37.124402 | 4 | < 0.1% | |
| 29.356631 | 4 | < 0.1% | |
| Other values (972658) | 999958 | > 99.9% |
| Value | Count | Frequency (%) | |
| 14.015803 | 1 | < 0.1% | |
| 15.573406 | 1 | < 0.1% | |
| 15.750687 | 1 | < 0.1% | |
| 15.759329 | 1 | < 0.1% | |
| 15.837289 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 61.303145 | 1 | < 0.1% | |
| 60.718755 | 1 | < 0.1% | |
| 60.651686 | 1 | < 0.1% | |
| 60.566407 | 1 | < 0.1% | |
| 60.487042 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| symboling | normalized-losses | wheel-base | length | width | height | curb-weight | engine-size | bore | stroke | compression-ratio | horsepower | peak-rpm | city-mpg | highway-mpg | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 145.906176 | 90.576742 | 164.257910 | 65.339016 | 55.257411 | 2119.914136 | 108.063442 | 3.659766 | 3.413379 | 7.431662 | 126.769640 | 4800.000000 | 16.998459 | 21.094419 |
| 1 | -1 | 92.733744 | 105.905393 | 187.256268 | 68.851686 | 53.507441 | 3086.763228 | 89.138707 | 3.693169 | 3.436936 | 9.335766 | 85.199136 | 5954.208367 | 16.835254 | 23.957619 |
| 2 | 0 | 87.620269 | 96.992974 | 172.988044 | 65.355847 | 55.275391 | 2353.623357 | 109.261573 | 3.481147 | 3.409794 | 8.625902 | 64.963110 | 5207.825450 | 27.964604 | 32.979688 |
| 3 | 2 | 107.851112 | 96.083754 | 166.746214 | 65.499869 | 51.568524 | 3117.821531 | 99.979531 | 3.598569 | 3.461714 | 10.262772 | 77.154984 | 4076.997829 | 26.991856 | 31.718865 |
| 4 | 3 | 149.361994 | 99.346480 | 178.441091 | 66.346505 | 51.047962 | 2602.065082 | 197.328140 | 3.069738 | 3.324375 | 9.444737 | 133.745785 | 4800.000000 | 14.244346 | 22.570794 |
| 5 | 2 | 98.334510 | 96.406994 | 183.504849 | 64.731031 | 57.999769 | 2396.758680 | 97.458361 | 3.576548 | 3.408376 | 12.929740 | 67.933822 | 4800.000000 | 39.488866 | 28.457586 |
| 6 | 1 | 96.619522 | 94.684346 | 150.833722 | 63.185266 | 52.068419 | 2050.941815 | 90.226273 | 3.045585 | 3.345146 | 9.362339 | 64.379631 | 5954.970776 | 28.017328 | 28.402865 |
| 7 | 1 | 117.263816 | 99.025148 | 152.619765 | 68.666973 | 56.014758 | 1783.952625 | 131.805892 | 2.906997 | 3.417757 | 8.267729 | 159.537591 | 5288.499176 | 24.071256 | 33.062782 |
| 8 | 2 | 99.157131 | 96.708796 | 180.172684 | 63.930420 | 52.796516 | 2760.344910 | 99.853330 | 3.167853 | 3.303717 | 9.264914 | 147.308582 | 4800.000000 | 17.041798 | 25.763443 |
| 9 | 2 | 131.848350 | 93.529253 | 159.268335 | 63.470886 | 52.520844 | 2019.713633 | 92.211237 | 2.874809 | 3.404221 | 9.000000 | 70.983376 | 5279.593350 | 29.332414 | 36.756228 |
Last rows
| symboling | normalized-losses | wheel-base | length | width | height | curb-weight | engine-size | bore | stroke | compression-ratio | horsepower | peak-rpm | city-mpg | highway-mpg | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 999990 | 1 | 115.402159 | 90.570079 | 163.839023 | 64.445463 | 54.529805 | 1908.596845 | 87.894701 | 2.889040 | 2.828961 | 9.000000 | 54.495953 | 4800.000000 | 40.967622 | 41.850046 |
| 999991 | -1 | 131.995869 | 96.812853 | 170.940698 | 65.397087 | 52.633467 | 3409.854862 | 109.116760 | 3.154569 | 3.368657 | 8.628101 | 114.032494 | 4087.143614 | 19.244789 | 27.106228 |
| 999992 | -1 | 126.874412 | 103.744737 | 172.070084 | 64.225101 | 55.864744 | 2280.330639 | 97.591499 | 3.127087 | 3.308171 | 9.000000 | 70.595923 | 4800.000000 | 29.085576 | 35.831991 |
| 999993 | 1 | 114.341019 | 95.291598 | 170.935406 | 64.245729 | 50.630898 | 2015.355497 | 97.459511 | 3.339956 | 3.174657 | 8.660770 | 74.109929 | 5459.688970 | 34.438976 | 43.471154 |
| 999994 | 1 | 164.528877 | 89.452779 | 158.822803 | 63.923670 | 54.639922 | 2099.041447 | 97.797166 | 3.497598 | 2.914149 | 9.000000 | 108.367333 | 4800.000000 | 22.674282 | 30.037071 |
| 999995 | 1 | 131.889643 | 94.683822 | 152.600348 | 63.753260 | 55.058248 | 1898.035998 | 94.051597 | 3.135750 | 2.333443 | 9.000000 | 64.637229 | 5215.317878 | 39.611105 | 32.011279 |
| 999996 | 1 | 111.525916 | 94.187341 | 164.683562 | 64.375117 | 50.068405 | 1838.700404 | 86.107942 | 3.260747 | 3.388072 | 7.344748 | 63.710452 | 5356.350668 | 41.574036 | 36.153130 |
| 999997 | 3 | 142.307625 | 111.712965 | 189.704961 | 68.347307 | 52.333088 | 2750.380050 | 166.112965 | 3.166962 | 3.328309 | 9.272152 | 146.966438 | 5207.106730 | 29.562491 | 32.558169 |
| 999998 | 1 | 83.784384 | 92.137350 | 188.034886 | 65.029247 | 52.375108 | 2322.327103 | 101.061642 | 3.189305 | 3.054712 | 8.763895 | 68.864198 | 5201.943517 | 24.321078 | 36.973930 |
| 999999 | 2 | 101.334427 | 108.081357 | 171.797371 | 63.758830 | 55.065116 | 2122.465426 | 89.663024 | 2.846550 | 3.396919 | 8.020933 | 67.234099 | 5226.270626 | 27.434386 | 34.196120 |